NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

STZ: A High Quality and High Speed Streaming Lossy Compression Framework for Scientific Data

https://doi.org/10.1145/3712285.3759795

Wang, Daoce; Grosset, Pascal; Pulido, Jesus; Tian, Jiannan; Athawale, Tushar; Jia, Jinda; Sun, Baixi; Zhang, Boyuan; Jin, Sian; Zhao, Kai; et al (November 2025, ACM/IEEE)

Free, publicly-accessible full text available November 15, 2026
COMPSO: Optimizing Gradient Compression for Distributed Training with Second-Order Optimizers

https://doi.org/10.1145/3710848.3710852

Sun, Baixi; Liu, Weijin; Pauloski, J Gregory; Tian, Jiannan; Jia, Jinda; Wang, Daoce; Zhang, Boyuan; Zheng, Mingkai; Di, Sheng; Jin, Sian; et al (February 2025, ACM)

Free, publicly-accessible full text available February 28, 2026
A Survey on Error-Bounded Lossy Compression for Scientific Datasets

https://doi.org/10.1145/3733104

Di, Sheng; Liu, Jinyang; Zhao, Kai; Liang, Xin; Underwood, Robert; Zhang, Zhaorui; Shah, Milan; Huang, Yafan; Huang, Jiajun; Yu, Xiaodong; et al (May 2025, ACM Computing Surveys)

Error-bounded lossy compression has been effective in significantly reducing the data storage/transfer burden while preserving the reconstructed data fidelity very well. Many error-bounded lossy compressors have been developed for a wide range of parallel and distributed use cases for years. They are designed with distinct compression models and principles, such that each of them features particular pros and cons. In this paper we provide a comprehensive survey of emerging error-bounded lossy compression techniques. The key contribution is fourfold. (1) We summarize a novel taxonomy of lossy compression into 6 classic models. (2) We provide a comprehensive survey of 10 commonly used compression components/modules. (3) We summarized pros and cons of 47 state-of-the-art lossy compressors and present how state-of-the-art compressors are designed based on different compression techniques. (4) We discuss how customized compressors are designed for specific scientific applications and use-cases. We believe this survey is useful to multiple communities including scientific applications, high-performance computing, lossy compression, and big data.
more » « less
Free, publicly-accessible full text available May 2, 2026
Multifacets of lossy compression for scientific data in the Joint-Laboratory of Extreme Scale Computing

https://doi.org/10.1016/j.future.2024.05.022

Cappello, Franck; Acosta, Mario; Agullo, Emmanuel; Anzt, Hartwig; Calhoun, Jon; Di, Sheng; Giraud, Luc; Grützmacher, Thomas; Jin, Sian; Sano, Kentaro; et al (February 2025, Future Generation Computer Systems)

Free, publicly-accessible full text available February 1, 2026
Concealing Compression-accelerated I/O for HPC Applications through In Situ Task Scheduling

https://doi.org/10.1145/3627703.3629573

Jin, Sian; Di, Sheng; Vivien, Frédéric; Wang, Daoce; Robert, Yves; Tao, Dingwen; Cappello, Franck (April 2024, ACM)

Full Text Available
Machete: An Efficient Lossy Floating-Point Compressor Designed for Time Series Databases

https://doi.org/10.1109/DCC58796.2024.00061

Shi, Yang; Zou, Xiangyu; Chen, Xinyu; Jin, Sian; Tao, Dingwen; Deng, Cai; Chen, Yufan; Xia, Wen (March 2024, IEEE)

Full Text Available
High-performance Effective Scientific Error-bounded Lossy Compression with Auto-tuned Multi-component Interpolation

https://doi.org/10.1145/3639259

Liu, Jinyang; Di, Sheng; Zhao, Kai; Liang, Xin; Jin, Sian; Jian, Zizhe; Huang, Jiajun; Wu, Shixun; Chen, Zizhong; Cappello, Franck (March 2024, Proceedings of the ACM on Management of Data)

Error-bounded lossy compression has been identified as a promising solution for significantly reducing scientific data volumes upon users' requirements on data distortion. For the existing scientific error-bounded lossy compressors, some of them (such as SPERR and FAZ) can reach fairly high compression ratios and some others (such as SZx, SZ, and ZFP) feature high compression speeds, but they rarely exhibit both high ratio and high speed meanwhile. In this paper, we propose HPEZ with newly-designed interpolations and quality-metric-driven auto-tuning, which features significantly improved compression quality upon the existing high-performance compressors, meanwhile being exceedingly faster than high-ratio compressors. The key contributions lie as follows: (1) We develop a series of advanced techniques such as interpolation re-ordering, multi-dimensional interpolation, and natural cubic splines to significantly improve compression qualities with interpolation-based data prediction. (2) The auto-tuning module in HPEZ has been carefully designed with novel strategies, including but not limited to block-wise interpolation tuning, dynamic dimension freezing, and Lorenzo tuning. (3) We thoroughly evaluate HPEZ compared with many other compressors on six real-world scientific datasets. Experiments show that HPEZ outperforms other high-performance error-bounded lossy compressors in compression ratio by up to 140% under the same error bound, and by up to 360% under the same PSNR. In parallel data transfer experiments on the distributed database, HPEZ achieves a significant performance gain with up to 40% time cost reduction over the second-best compressor.
more » « less
Full Text Available
Scientific Error-bounded Lossy Compression with Super-resolution Neural Networks

https://doi.org/10.1109/BigData59044.2023.10386682

Liu, Jinyang; Di, Sheng; Jin, Sian; Zhao, Kai; Liang, Xin; Chen, Zizhong; Cappello, Franck (December 2023, IEEE)
LibPressio-Predict: Flexible and Fast Infrastructure For Inferring Compression Performance

https://doi.org/10.1145/3624062.3625124

Underwood, Robert R; Di, Sheng; Jin, Sian; Rahman, Md Hasanur; Khan, Arham; Cappello, Franck (November 2023, ACM)

Full Text Available
Efficient MIMO PHY Abstraction With Imperfect CSI for Fast Simulations

https://doi.org/10.1109/LWC.2022.3233542

Cao, Liu; Zhang, Lyutianyang; Jin, Sian; Roy, Sumit (March 2023, IEEE Wireless Communications Letters)

Full Text Available

« Prev Next »

Search for: All records